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from the Sloan Digital Sky Survey (SDSS) and the other from a semi-analytic 



■ model (SAM) based on the Millennium Simulation (MS), to test to what extent 

current SAMs of galaxy formation are reproducing environmental effects. We 
estimate the large-scale environment of each galaxy using a Bayesian density 
estimator based on distances to all ten nearest neighbors and compare broad-band 
. photometric properties of the two samples as a function of environment. The 

^ feedbacks implemented in the semi-analytic model produce a qualitatively correct 

galaxy population with similar environmental dependence as that seen in SDSS 
. galaxies. In detail, however, the colors of MS galaxies exhibit an exaggerated 

dependence on environment: the field contains too many blue galaxies while 
^ . clusters contain too many red galaxies, compared to the SDSS sample. We also 

find that the MS contains a population of highly clustered, relatively faint red 
galaxies with velocity dispersions comparable to their Hubble flow. Such high- 
2 . density galaxies, if they exist, would be overlooked in any low-redshift survey 

since their membership to a cluster cannot be determined due to the "Fingers of 
O : God" effect. 

p 

Subject headings: galaxies: fundamental parameters — galaxies: luminosity 
^ . function, mass function — galaxies: statistics 

1. Introduction 



Since the discovery of the morphology-environment relation (lDresslerlll980l ). it has been 
known that galaxy properties are correlated with their large-scale environment: the average 
morphology, color and luminosity of galaxies differ depending on how crowded their neigh- 
borhood is. On the face of it, it is not clear why or how the environment of a galaxy on 
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Mpc scales should be related to the kpc-scale processes (star formation, supernova and AGN 
feedback) that determine the bulk properties of a galaxy. To further confuse ni atters, the 
strong correlation between morphology, color and luminosity (IStrateva et al.ll200ll . and refer- 
ences therein) makes it unclear which property is ultimately driven by environment let alone 
which physical processes are responsible. It is not even clear to what extent the environment 
of a galaxy effects it through nature (different formation conditions) rather than nurture 
(galaxy-galaxy interactions). A critical step towards answering such questions is to compare 
observed trends to those present in a simulated galaxy ensemble in which one knows all the 
processes at work. 



The Sloan Digital Sky Survey (SPSS. lAdelman-McCarthy &: et al.l 120071 ) is a powerful 
tool for addressing questions of environmental effects. Its spectroscopic sample of galaxies is 
the largest such sample ever, ensuring that even relatively rare galaxy populations are well 
represented, and the survey's large contiguous footprint makes it easy to determine the large- 
scale environment for most of these galaxies. Previous researchers who have used the SDSS 
galaxy catalog to study environmental dependences have found three broad trends: the peaks 
of the bimodal color distribution of galaxies do not shift for different environments; blue and 
red galaxies are most common in low- and high-de nsity environments, respectively; the lumi- 



nosity of red galax i es increases with local densitv (IHogg et a 
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The Millennium Simulation (MS, ISpringel et al.ll2005l ) is the largest ever cosmological 
simulation comprising some lO^'' dark matter (DM) particles with a spatial resolution of 
5hiQQ kpc. At z 



0, the simulation fills a cube dOOh^QQ Mpc per side. The MS does not 



explicitly model the gas, dust and stars which make up observable galaxies, but it produces 
a DM halo merger tree which serves as the backbone for a number of semi-analytic models 
(SAMs). Unlike N-Body/SPH simulations, SAMs do not simulate the gravitational and 
hydrodynamic forces involved in the formation and evolution of galaxies, but they do provide 
a computationally inexpensive way to explore the parameter space of sub-grid processes. The 
trade-off is that the parameters of a SAM must be tuned using observations (e.g. matching to 
the observed luminosity function), making truly independent comparisons between the model 
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De Lucia fcBlaizotI 120071 . and references therein). Their models differ — for example in their 
treatment of AGN feedback — but all reproduce some of the empirical features of galaxy 
populations. The MS galaxies have a very realistic distribution of luminosities, thanks to 
judicious use of "radio" feedba ck. They also exhibit a bimodal color distribution as discovered 
in SDSS (IStrateva et al.ll200ll ). Finally, the power spectrum of the density fluctuations is in 
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good agreement with the empirical data from 2dF and SDSS (ISpringel et aLll2005l ). Previous 
investigators have fo und that the brightes t MS galaxies are red, dead ellipticals populating 
rich galaxy clusters (jPe Lucia et al.ll2006l ). while the modeled galaxies in the very lowest- 
density environme nts have similar colors and star formation rates as analogous SDSS galaxies 
(IPatiri et al.lbooeh . 



In this work we compare the observed galaxy populations with those produced with 
SAMs. Our work differs from those listed above in the following ways: we use Bayesian 
number density as a proxy for local environment, rather than the commonly used surface 
density or two-point correlation function; we use the u — r color, which has more leverage 
than the g — r color; we use SDSS Data Release 5, rather than any of the previous (smaller) 
releases; and last but not least, we compare observed and modeled galaxies for the full range 
of galaxy environments and colors. 



Selection Criteria 



We use a sample of 674,749 galaxies from the SDSS Data Release 5 (DR5) main galaxy 
sample, an optical imaging and spectroscopic survey of galaxies over 1/4 of the sky (with 
limiting magnitude r < 17.7 after foreground extinction removal). We create a complete 
volume and luminosity-limited sample with 0.01 < z < 0.077 (or distances of 43-345 Mpc 
from the Milky Way) and Mr < —20, leaving 90, 689 galaxies. The cha racteristic galaxy 
luminosity in the SDSS r-band is M* = —20.60 (IBlanton et al.l l2003bl ). which falls well 
within our magnitude limit. We use model magnitudes corrected for foreground extinction 
but do not apply K corrections (we instead apply K corrections to th e model galaxies); 
absolute magnitudes are computed using ho = 0.732 (ISpergel et al.l 120071 ). As our sample of 
SDSS galaxies only extends to lookback times less than 1 Gyrs, it is representative of local 
galaxies. 



We use the modeled galaxies generated by De Lucia fcBlaizotI (120071 ) and available online 
through the Millennium Simulation database il The quantities we use are the Cartesian 
positions and velocities of the galaxies, as well as their absolute SDSS u and r magnitudes, 
which include dust extinction fr om both a diffuse IS M and attenuation of stars in young 
clusters in the emitting galaxy (IDe Lucia et al.ll2006l ). We make a < —20 cut on the 
z = snapshot, resulting in a complete sample of 1,805,780 galaxies in the simulation volume. 
We create a mock observational catalog for an observer at the origin by computing the right 
ascension, declination and distance modulus of each galaxy. Since the model galaxies have 
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rest-fr ame colors, we apply K corrections using the model spectra of iBruzual fc Chariot 



( 120031 ). although this changes the colors by less than 0.2 magnitudes. We then make cuts on 
radial distance, keeping only those galaxies which fall within the 43-345 Mpc range of our 
volume-limited SDSS sample. This mock "survey" covers one eighth of the sky and contains 
110,437 galaxies, ~ 20% more than our SDSS sample. 



3. Bayesian Density Estimator 



The most common proxy for environment is the num ber density of galaxie s, or the sur- 
face density of galaxies within redshift slic es. For exampl e . iBlanton et al.l (120051 ) use a depro- 
jected angular correlation function, while Scoville et al. ( 20071 ) use an adaptively smoothed 
surface-density. Other groups have used three-d imensional density estimators, s uch as over 
densi t y on a 8h~^ Mpc sc ale (IHogg et al.l 120031 ) or within a smoothing kernel (jPark et al. 



20071 ). iMateus et al.l (120071 ) use a hybrid of three-dimensional and two-dimensional 10th near- 
est neighbor density, noting that the former tends to under-estimate density in high-mass 
galaxy clusters. It is also common p ractice to use the t hree-dimensional galaxy correlation 
function as a metric for density (eg: IZehavi et al.l 120051 ). although it should be noted that 



the two-point correlation function is insensitive to higher order correlations which almost 
certainly exist between galaxies. 

We compute densities using three-dimensional positions. Rather than use the traditional 
tenth nearest neig hbor metric for number density, A^io = ^/d^Q, we use a Bayesian metric 
(llvezic et al1l2005h : 



n 



C- 



where = if computing the density at the location of a galaxy. The constant C = 11.48 is 
empirically determined by demanding that (n) matches actual number d ensity when density 
is estimated on a regular grid for a uniform density field. As shown in llvezic et al.l (120051 ). 
the use of distances to all ten neighbors, as opposed to only the tenth neighbor, results in a 
factor of ~ 2 improvement in the precision of density estimates. 

Although we are mostly interested in the density at the position of galaxies, the Bayesian 
density estimator can be used at arbitrary positions, allowing us to construct a density map 
on a regular grid. In Fig.[T]we show the density m ap (with resolutio n 1 Mpc) for an equatorial 
slice of DR5, centered on the Sloan Great Wall of iGott et al.l (l2005l ). Thanks to the precision 
of our density estimator, it is easily discernible that the "wall" is not a monolithic structure. 
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but results from the juxtaposition of a collection of large clusters of galaxie 



4. Observational Effects 

Three observational effects which plague the SDSS galaxy sample could significantly 
affect the density distribution of galaxies: incompleteness, edge effects, and "Fingers of 
God" . In this section we describe these effects and quantify how they influence the density 
distribution of SDSS galaxies. 

For galaxies falling within 55" of each other, only one galaxy gets a fiber, due to fiber 
collisions. Since 30% of the SDSS survey area consists of overlap regions between neigh- 
boring fields, the net effect of fiber collisions is a los s of 6% of the pho tometric galaxies 



that would otherwise be in the spectrocopic catalogue (jStrauss et al.ll2002l ). Fiber collisions 
notwithstanding, more than 95% of galaxies in the SDSS photometric catalogue are given 
a fiber and are in the spectroscopic catalogue. The bulk of the remaining 5% suffer from 
blending with satur ated stars and do not significantly bias the spectroscopic galaxy sample 
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The tili ng of SDSS fields is such that there are no gaps except near the edges of the 



survey area (iBlanton et al.ll2003al ) . The density estimated near the edge of the DR5 footprint 
will be artificially low because SDSS spectra have not been obtained for many of the true 
nearest neighbors. To remove the most egregious offenders, we do not compute densities 
for any galaxies with fewer than 10 neighbors within a 10 Mpc radius or for galaxies falling 
within 10 Mpc of our redshift limits. 

Galaxies in the SDSS spectroscopic catalog have small redshift uncertainties (30 km/ sec), 
but the true limiting factor for determining the radial distance to galaxies is the "Fingers 
of God" effect: massive galaxy clusters have large velocity dispersion, a, which has the ef- 
fect of smearing them out in redshift-space and reducing their apparent density by a factor 
l/(l + a/cz). 

To quantify the impact of observational effects, we compare the density distribution of 
our our mock survey of MS galaxies with that for the same survey but in which we model these 
effects. Fiber collisions are conservatively implemented by ignoring all neighbors within 55" 
of a galaxy when computing density (this affects 6% of the galaxies, in good agreement with 
the estimate for SDSS fiber collisions). General spectroscopic incompleteness is implemented 



^With the visualization shown in Fig. [T] this structure is more reminiscent of a mountain range than a 
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by removing 5% of the galaxies at random from the catalogue. To increase the surface area to 
volume ratio — and hence the importance of edge effects — we limit our samples to galaxies 
lying less than 10 Mpc from a survey edge (reducing the size of the sample by a factor ~ 5). 
We model Fingers of God by adding the peculiar velocity of each galaxy (obtained from the 
MS database) to its model Hubble flow, then computing its apparent radial distance from 
this mock redshift rather than from its actual Cartesian position. 

The density distributions for the mock MS survey with and without observational effects 
are shown as solid and dotted lines in the right panel of Fig. [21 The distribution remains 
unaffected except at high densities, where Fingers of God completely erase the high density 
tail (~ 3% of galaxies). If this extremely high density population of galaxies exists, it will 
only be detected in the next generation surveys operating at higher redshifts, where the 
Hubble flow dominates over peculiar velocities. For the remainder of the paper we compare 
our SDSS galaxy sample to the mock MS survey, including the effects of fiber collisions, 
spectroscopic incompleteness and Fingers of God. 



5. Environment and Photometry of Galaxies 

For the purposes of plotting our results, we remove outliers from both galaxy samples 
by cutting out the top and bottom percentile in color, as well as the top and bottom 0.1% 
in luminosity and density. The MS histograms are rescaled to the same total number of 
galaxies as for the SDSS. Fig.[2]shows the density distributions for the S PSS and MS galaxies , 



separated into the blue and red mode based on the u — r = 2.2 cut of IStrateva et al.l (120011 ) 
(For comparison, the peak of the density distribution for random positions in the survey is 
n = 10"^'^ Mpc~^ in either galaxy sample). There are 50% too many blue galaxies in the 
MS as compared to the SDSS, despite the fact that the minimum in color for both samples 
occurs at u — r = 2.2 (see Fig. H]). 

The luminosity function for both sets of galaxies, shown in Fig. [31 match very well 
except for the over-representation of blue galaxies in the MS. The luminosity function of the 
very dense (n > 1 Mpc~3) modeled galaxies — invisible in the mock survey — is shown with 
the dotted line in the left panel of Fig. [31 These extremely high density environments are 
populated by relatively faint (M^ > —21.5) red galaxies, not LRGs. The luminosity function 
for the lowest and highest density quartiles, shown in the right panel of Fig. [21 indicates that 
the SAM reproduces the environmental dependence of luminosity. 

Fig. [H shows the color distribution for the lowest and highest density quartiles of each 
sample. The SDSS and MS galaxy samples both exhibit a bimodal color distribution with 
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a minimum at m — r = 2.2, although the blue peak is too pronounced for MS galaxies. The 
peaks of the blue and red populations for the MS galaxies are approximately 0.2 magnitudes 
too blue, as compared to the SDSS galaxies. In both panels, the relative heights of the red 
and blue peaks change as a function of density. Red galaxies represent ~ 2/3 of the highest 
density quartiles of both the SDSS and MS samples. The environmental dependence of color 
is exaggerated for the lowest- density quartile in the MS, howev er: 79% are blue, compared 



to only 52% for the SDSS sample (see also iPatiri et al.ll2006l ). The right panel of Fig. H] 



indicates that the SAM fails to reproduce the dependance of color on luminosity, namely 
that brighter red galaxies are redder. 

All of the features and discrepancies described above can be qualitatively seen in Fig. [5l 
which combines the color, luminosity and density information for all the galaxies. Labeled 
white lines show which regions on the plot are most populated, while the color contours 
denote the median density of galaxies in a given color-magnitude bin. For galaxies less 
luminous than = —22, u — r color tracks density, while luminosity is independent of 
environment. For the brightest galaxies, however, density correlates with luminosity and not 
with color. 



6. Conclusions 



We have compared two analog ous galaxy samples, one fro m the SDSS DR5 spectroscopic 
sample, and one from the SAMs of iDe Lucia fcBlaizotI (120071 ). after correcting for the obser- 
vational effects present in the former. The density distribution and the luminosity function 
of the modeled galaxies qualitatively match those for the SDSS sample, but there are 50% 
too many blue galaxies in the former. In detail, two additional discrepancies become appar- 
ent between the galaxy samples: MS galaxies are more blue mu — r than SDSS galaxies; the 
colors of galaxies depend more strongly on environment in MS than in SDSS. The strong 
environmental dependence manifests itself as an over-representation of blue galaxies overall 
and suggests that the feedbacks implemented by iDe Lucia fcBlaizotI (120071 ) exaggerate the 
role of galaxy environment. A population of relatively faint red galaxies in extremely high 
density environments is visible in the MS survey sans observational effects. Such high den- 
sity environments would be imperceptible in SDSS due to velocity dispersions comparable 
to local Hubble flow. 
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Fig. 1. — A map of the Sloan Great W all, located on th e celestial equator some 350 Mpc 
away from the Milky Way (see Fig. 9 in lGott et al.ll2005l ). Due to the large distance to the 
purported structure, we applied a Mr < —21 cut to the DR5 galaxy catalog, yielding a sample 
of 129,974 galaxies complete to z = 0.12. The top panel shows the actual distribution of 
galaxies within ±7° of the equatorial plane. The bottom panel shows a Bayesian density map 
with spatial resolution of 1 Mpc with black areas corresponding to low-density environments, 
while red regions are the most massive clusters. This visualization resolves the wall into 
something more reminiscent of a mountain range. 
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Fig. 2. — Local number density distribution for SDSS and MS galaxies in the left and right 
panels, respectively. The blue and red lines represent the density distribution for blue and 
red galaxies based on a u — r = 2.2 cut. For reference, the dotted lines in the left panel show 
the distributions for blue and red MS galaxies (same as the solid lines in the right panel). 
The dotted lines in the right panel represents the density distribution of the MS before the 
application of observational effects. 
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Fig. 3. — Luminosity function for SDSS (solid) and MS (dotted) galaxies divided by color in 
the left panel and by density in the right panel. Red and blue lines in the left panel represent 
the luminosity functions for red and blue galaxies, based on a m — r = 2.2 cut. The dashed 
line in the left panel shows the luminosity function of the MS galaxies with n > 1 Mpc~^ 
before the application of observational effects. Green and magenta lines in the right panel 
represent the luminosity function for the lowest and highest quartiles in density (normalized 
at the faint end). 
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Fig. 4. — Color distribution for SDSS (solid) and MS (dotted) galaxies divided by density 
in the left panel and by luminosity in the right panel. Green and magenta lines in the left 
panel represent the color distribution for the lowest and highest quartiles in the density. 
Yellow and purple lines in the right panel represent the luminosity function for the lowest 
and highest quartiles in luminosity. 
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Fig. 5. — Color magnitude diagram for SDSS and MS galaxies in the left and right panels, 
respectively. Labeled white lines show which regions on the plot are most populated (these 
are complete volume-limted samples), while the color-coded background shows the median 
local environment around galaxies with a given color and magnitude (dark corresponds to 
low densities; bright corresponds to high densities). 



